Telephone speech recognition using neural networks and hidden Markov models

نویسندگان

  • Dong-Suk Yuk
  • James L. Flanagan
چکیده

The performance of well trained speech recognizers using high quality full bandwidth speech data is usually degraded when used in real world environments In particular telephone speech recognition is extremely di cult due to the limited bandwidth of transmission channels In this paper neural network based adaptation methods are applied to telephone speech recognition and a new unsupervised model adaptation method is proposed The advantage of the neural network based approach is that the retraining of speech recognizers for telephone speech is avoided Furthermore because the multi layer neural network is able to compute nonlinear functions it can accommodate for the non linear mapping between full bandwidth speech and telephone speech The new unsupervised model adaptation method does not require transcriptions and can be used with the neural net works Experimental results on TIMIT NTIMIT corpora show that the performance of the proposed methods is comparable to that of recognizers retrained on telephone speech

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...

متن کامل

Persian Phone Recognition Using Acoustic Landmarks and Neural Network-based variability compensation methods

Speech recognition is a subfield of artificial intelligence that develops technologies to convert speech utterance into transcription. So far, various methods such as hidden Markov models and artificial neural networks have been used to develop speech recognition systems. In most of these systems, the speech signal frames are processed uniformly, while the information is not evenly distributed ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999